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Abstract 

In the capacitated vehicle routing problem, introduced by Dantzig and Ramser in 1959, we are given 
the locations of n customers and a depot, along with a vehicle of capacity fc, and wish to find a minimum 
length collection of tours, each starting from the depot and visiting at most k customers, whose union 
covers all the customers. We give a quasi-polynomial time approximation scheme for the setting where 
the customers and the depot are on the plane, and distances are given by the Euclidean metric. 

1 Introduction 

In 1959, Dantzig and Ramser introduced the vehicle routing problem (VRP) and gave a linear pro- 
gramming based Algorithm whose "calculations may he readily performed by hand or automatic digital 
computing machine"[%]. Since its introduction, VRP has been widely studied by researchers in Opera- 
tions Research and Computer Science. Several books (see [16], [10] and [9], among others) have been 
written on the VRP and it even has its own wikipedia page [19]. 

VRP is used to describe a class of problems where the objective is to find low cost delivery routes 
from depots to customers using a vehicle of limited capacity. When Dantzig and Ramser first introduced 
VRP they stated that "no practical applications have been made as yet"[S]. Since that time, applications 
of various VRP problems have been identified in numerous industries where transportation costs matter 
such as food and beverage distribution, and package and newspaper delivery. Toth and Vigo report on 
businesses that saved between 5 % and 20% of total costs by solving VRP problems using computerized 
models [16]. 

The capacitated vehicle routing problem. We study the most basic form of the vehicle routing 
problem, the capacitated version (CVRP), where the input is n -I- 1 points representing the locations of 
n customers and one depot, and a vehicle of capacity k. The objective is to find a collection of tours, 
starting at the depot and visiting at most k customers, whose union covers all n customers, such that 
the sum of the lengths of the tours is minimized. CVRP is also referred to as the fc-tours problem in 
the Computer Science literature [1, 4]. We focus on the 2d-Euclidean version of CVRP where customers 
and the depot are on the Euclidean plane. 

Previous work. Several results are already known about the approximability of CVRP. When the 
capacity of the vehicle k is 2, it can be solved in polynomial time using minimum weight matching. 
The metric case was shown to be APX-complete for all 3 < fc; Asano et al. presented a reduction from 
H-matching for k = 0(1) and there is a simple reduction from the traveling salesman problem (TSP) 
for larger k [5]. Constant factor approximation with performance (1-1- a) with a being the best constant 
factor approximation for TSP, were presented by Haimovich and Rinnooy Kan [11]. 

The existence of a PTAS for the 2rf-Euclidean version remains an active area of research. Extending 
[11], Asano et al. presented a PTAS for the case k — 0(log7i/ loglogrt) [4]. Arora's work implies a 
PTAS for k = n{n) [4]. 

Our result. We present a quasi-polynomial time approximation scheme for the entire range of k. 



Theorem 1.1. (Main Theorem) Algorithm 1 is a randomized quasi-polynomial time approximation 
scheme for the 2d-Euclidean capacitated vehicle routing problem. Given e > 0, it outputs a solution with 
expected length (1 + 0(e)) OPT, in time n'°s°'^^''' The Algorithm can be derandomized. 

Where previous approaches fail. Our approximation scheme uses the divide and conquer approach 
that Arora used in designing a PTAS for Euclidean TSP [1]. Like Arora, we "divide" the problem using 
a randomized dissection that recursively partitions the region of input points into progressively smaller 
squares. We search for a solution that goes back and forth between adjacent squares a limited number 
of times and always through a small number of predetermined sites called portals that are placed along 
the boimdary of squares. It is natural to attempt to extend the TSP structure theorem and show that 
there exists a near optimal solution that crosses the boundary of squares a small number of times, and 
then use dynamic programming. Unfortunately [2], 

"we seem to need a result stating that there is a near-optimum solution which enters or leaves 
each area a small number of times. This does not appear to be true. [...J The difficulty lies in 
deciding upon a small interface between adjacent squares, since a large number of tours may 
cross the edge between them. It seems that the interface has to specify something about each 

of them, which uses up too many bits. " 

Indeed, to combine solutions in adjacent squares it seems necessary to remember the number of 
points covered by tour segments, and that is is too much information to remember. 

Overview of our approach. To get around this problem we remember approximately how many 
points are on each tour segment. Specifically we round the number of points on each tour segment down 
to a multiple (1 + e/ log n) , which means there are only 0(log k) possibilities for the approximate number 
of points on each tour segment. This enables us to deal with the difficulties described by Arora: now we 
have a small interface between adjacent squares, namely, for every pair of portals and every approximate 
number of points, we remember the number of tour segments that have this profile. The quasipolynomial 
running time of our dynamic program (DP) follows since the number of profiles is polylogarithmic and 
as there are at most n tour segments of each profile. 

One challenge with only remembering the approximate number of points per tour segment is that 
the tours found by our dynamic program (DP) could now cover a httle bit more than k points. To 
deal with this issue, whenever a tour of the DP solution covers more than k points we carefully choose 
enough points and drop them from the tour to make it feasible. Finally we compute a solution just on 
the dropped points using a 3-approximation [11] and prove that it has negligible length compared to 
OPT. 

In section 2 we present our approximation scheme and prove its correctness under the assumptions 
that the DP solution is near optimal (Theorem 2.2, proved in section 4) and that the 3-approximation 
solution on the dropped points has length at most 0(e)OPT (Theorem 2.5, proved in section 5). Section 
3 presents the DP and section 6 the derandomization. 

Related Work. Our work builds on the approach that Arora [1] used in designing a PTAS for the 

geometric traveling salesman problem. Similar techniques were also presented by Mitchell [13]. Recently 
these techniques have been applied to design approximation schemes for several NP-Hard geometric 
problems, including the polynomial time approximation schemes for Steiner Forest [6], and i^-Median 
[12] and quasipolynomial time schemes for Minimum Weight Triangulation [15] and Minimum Latency 
problems [3] among others. See [2] for a survey of these techniques and a discussion about generalizing 
them to other problems. 

2 The Algorithm 

An overview of our approximation scheme is given in Algorithm 1. We use a quasipolynomial time DP 
to find a near optimal solution, OPT^^, which may include some tours that cover more than k points. 
We drop points from each infeasible tour of OPT^^, choosing the points carefully using a randomized 
procedure to obtain the set of feasible black tours. A solution on the dropped points, the red tours, is 
obtained using a 3-approximation. We output the union of the red and black tours. 
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Algorithm 1 CVRP approximation scheme 

Input: n points in R,^ and an integer k. 

1: Perturb instance, perform random dissection and place portals as described in section 2.1. 
2: Use the DP from section 3 to find OPT^^ which is defined in subsection 2.2. 

3: Trace back in the DP's history to construct tours and assign types to points using the randomized type 

assignment from subsection 2.4. 
4: A point is black if has type -1 and red otherwise. Drop all red points from the OPT^^ tours. 
5: Get a solution for the red points using the 3-approximation Algorithm from subsection 2.3. 

Output: the union of the red tours on the red points and the black tours on the black points. 



2.1 Preprocessing [1] 

Perturbation. Algorithm 1 will work on a perturbed instance. Following Arora's approach from [1], 
we perturb and scale so that all points fit into a square bounding box of side length L = 0{ti) where L 
is a power of 2, and so that all points have integer coordinates and the distance between any two points 
is either or at least 4. A solution for the perturbed instance can be extended to the original instance 
using additional length 0(e) OPT. See appendix A for the details. Prom this point, OPT will denote 
the optimal CVRP solution to the perturbed instance. 

Randomized Dissection. Using Arora's divide and conquer paradigm from [1], we perform a ran- 
domized dissection of the input by recursively partitioning the bounding square into 4 smaller squares 
of equal size using one horizontal and one vertical dissection line. The recursion stops when the smallest 
squares have size 1x1. We define levels for the squares and the dissection lines. See appendix A for 
the details. The maximum level of any square or line is imax = O(logn). The probability that a line I 
becomes a level £ dissection line in the randomized dissection is 

Pr(level(Z) = £) = (1) 

Portals. As in [1], we place points called portals on the boundary of dissection squares that will be the 
entry and exit points for tours. Let m = 0(logn/e) and a power of 2. Place 2^m portals equidistant 
apart on each level £ dissection line for all £ < £max- Since a level £ line forms the boundary of 2^ level 
£ squares there will be at most at Am portals along the boundary of any dissection square 6. As m and 
L are powers of 2, portals at lower level squares will also be portals in higher level squares. 

Definition 2.1. (Portal respecting and light) A tour is portal respecting if it crosses dissection lines 
only at portals. A tour is light if it is portal respecting and crosses each side of a dissection square at 

most r = 0{l/e) times. 

Arora showed that for TSP there exists a near optimal solution which is light. 

Theorem 2.1. [1] Let OPT{T SP) denote the optimal solution for an instance of Euclidean TSP and 
let D he its randomized dissection. With probability at least 1/2 there exists a salesman tour of length 
(1 + 0(e))OPT that is light with respect to dissection D. 

2.2 The Structure Theorem 

We formally define the solution computed by our DP. Recall that remembering the exact number of 
points on tour segments may require too many bits. Instead, we define 0(logA;) thresholds that are 
multiples of (1 + e/logn) in the range [1, k\ and round down the number of points on each segment to 
its closest threshold. Given a tour segment covering a non-threshold number of points x with t being 
the closest threshold value less than x, to "round" the segment we set the type of exactly x — t points to 
indicate they should be dropped from the segment. When two tour segments each covering a threshold 
number of points are concatenated, the new segment may have to be "rounded" again as the sum of two 
thresholds do not necessarily add to another threshold. The DP works bottom-up in the dissection tree 
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and may have to "round" tour segments at each level of the tree. It will mark the points that need to 
be dropped at level £ by setting their type to I. Points with a type in [0, Imax] are dropped from the 
final tours. 

Definition 2.2. (Thresholds, Types, and Rounded Segments) 

• Let T = log(i_|_£/iog„)(A;/e). We define a sequence 0/ r + 1 thresholds .• t^ = 1, t\ = 1/e, 
ti = l/e(l + e/logn)^-i, . . . , tr = k. 

• Assum,e that each point has a type which is an integer in [—l,£max]- We say that a point is active 
at level i if its type is strictly less than £. 

• Let -K = {-Ki) be a set of tours. For any Wi and any dissection square b at level £, a segment is 
a connected component of Hi Ob. A scgm.ent is cither rounded or unrounded, with the following 
property: for each rounded segment s, there is a threshold ti such that s covers exactly ti active 
points. 

The DP builds tours are allowed to cover more than k points and thus in one sense the DP solves 
a relaxed version of CVRP. To ensure that the additional cost of making the DP solution feasible small 
compared to OPT, we only round tour segments inside a dissection square when there are many, at least 
7 (defined below), segments entering the square in which case the cost of going from the depot to the 
dropped points in the square can be charged to OPT. If there are only a few, less than 7 per threshold, 
tour segments entering the square, we can afford to remember the exact number of points per segment. 
The intuition behind the third part of definition 2.3 is to limit the number of points that are marked to 
be dropped from each tour. 

Definition 2.3. (Relaxed CVRP) A relaxed CVRP is a set of tours such that there exists an assignment 

of types to the points with the following properties: 

1. Each tour covers the depot, at most k points of type —1, and possibly some points of type > — 1. 
The union of the tours covers all n points. 

2. Let 7 = log^n/e*. In any dissection square b for each threshold value ti (i < t — 1) there are at 
most 7 unrounded segments covering between [ti, i^+i) points. The number of rounded segments in 
b covering exactly ti points that are active at level{b) is an integer multiple ofj. 

3. Let b be a dissection square and let s be a tour segment in b, which has t points that are active at 
levelib). Then segment s has at most t{l + e/ logn) active points at level{b) + 1. 

We apply the concept of being light to each tour of the solution individually rather than to the 

solution as a whole. 

Definition 2.4. Let D be a dissection and let S be a CVRP solution consisting of tours (tTj). S is called 
i-light if each tour ttj is light. 

We extend the objective function to include a penalty for tour crossings: 

Definition 2.5. (Extended Objective Function) Let n = (tTi) be a set of tours. For every level i let 

c{Tri,£) be the number of times tour iTi crosses the boundary of level £ squares, and dg = L/2^ be the 
length of a level £ square in the dissection. The extended objective function is: 

= lengthim) + -A^ V V c{-Ki, I) ■ di, (2) 
^ log n , „ 

Theorem 2.2. (Structure Theorem) In expectation over the shifts of the random dissection, the i-light 
and relaxed CVRP solution which minimizes the extended objective function F given in equation 2 has 
length {l + O{e))0PT. 

We prove Theorem 2.2 in section 4. Let OPT^^ denote the relaxed and i-light CVRP solution that 
minimizes the extended objective function F given in equation 2. We present a DP to compute OPT^^ 
in section 3. 
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2.3 A constant factor approximation [11] 

For a solution of the red points we use the constant factor approximation of Haimovich and Rinnooy 
Kan which partitions a traveUng salesman tour of the points into tours that cover at most k points [11]. 
Algorithm 2 below is a version of the Algorithm presented by [11] and it is known to be a 3-approximation. 

Algorithm 2 TSP Partitioning 3-approximation [11] 

Input: n points and the depot in IR^ and an integer k. 

1: Let TT denote a tour of input points and the depot obtained using a 2-approximation of TSP. 
2: Choose a point p uniformly at random from it. 

3: Go around vr starting at p, and every time k points are visited, take a detour to the depot. 
Output the resulting set of |^n/A;J + 1 tours as the CVRP solution. 



Theorem 2.3. [11, 4J Let I denote the set of input points, o the depot, and d{i,o) denote the distance 
of point i from the depot. Define Rad{I) = '^/kJ2iei d{i,o) and let T5P(/U {o}) denote the length of 
the minimal tour of I and o. We have that: 

• Rad{I) < OPT, 

• TSP{lU{o}) < OPT and 

• in expectation Algorithm 2's solution has length Rad{I) + 2TSP{IU {o}) < 30PT. 
2.4 Assigning Types 

The 3-approximation on the red points has small length if the Rad and TSP of the red points have 

small value. To ensure this is the case, when selecting y points to drop from segment S, wc choose y 
such that length(y) < 0(e)length(5'), and such that average distance of points in y to the depot is only 
a 0(e) fraction of the average distance of points on segment S to the depot. Section 5 shows that both 
conditions hold if y is chosen as follows: 

Algorithm 3 Randomized Type Assignment Procedure 

Input: A tour segment S from a level £ square b containing Sa active points and requiring y active points 

to be dropped 

1: Select one active point p uniformly at random from Sa 

2: Starting at point p, select the next y — 1 points from Sa that lie consecutively after p on the segment 5; 
if the last active point on the segment before 6's boundary is reached without having selected y points, 
wrap around and select active points from the other end of the tour segment (after S enters 6) 

3: Label each of the y chosen points with type £. 



2.5 Proof of Theorem 1.1 

Lemma 2.4 proves the correctness and running time of a DP that computes OPT^^. See section 3 for 
its proof. 

Lemma 2.4. (Dynamic Program) Given the set of input points and a randomly shifted dissection, the 
dynamic program of Section 3 finds an i-light and relaxed CVRP solution that minimizes the objective 
F defined in Equation (2) in time n^°s°'^'^'' 

The length of the solution output by Algorithm 1 will be the sum of the lengths of the black tours 

and red tours. Since the black tours are obtained by dropping points from the DP solution they have 
length at most OPT^''^, which by Theorem 2.2 is at most (1 + 0(e))OPT. Theorem 2.5, which is proved 
in section 5, shows that the length of the red tours is 0(e)OPT. 
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Theorem 2.5. In expectation over the random shifts of the dissection and the random type assignment 

the length of the red tours output by Algorithm 1 is 0{e)OPT. 

Thus the solution output by Algorithm 1 has total length (1 + 0(e))OPT. The DP dominates the 
running time. The derandomization of the Algorithm is discussed in section 6. 

3 The Dynamic program 

This section presents the quasi-polynomial time dynamic program and proves Lemma 2.4. The dynamic 
program (DP) finds OPT^^, the relaxed and i-light solution that minimizes the extended objective from 
Equation (2). 

The DP table. A configuration C of a dissection square 6 is a list of entries: for each pair of portals 
p, q, the configuration has two sublists of entries, one to record information about rounded tour segments 

and one to record information about unrounded tour segments: 

1. first sublist: (r^'"^, . . . , r^'', . . . r^''), where r^''^ is the number of rounded tour segments that use 
portals p and q and cover exactly ti active points 

2. second sublist: (w?'', . . . , w^'', . . . , u^'-t), where u^'' is the number of active points covered by the 

j-th unrounded tour segment that uses portals p and q 

The DP has a table entry Li,[C] for each dissection square b and each configuration C of h. The table 
cell Lh[C] stores the minimum cost (according to objective F as defined in Equation 2) of placing tour 
segments in & in a way which is compatible with C and with the relaxed and i-light Definitions 2.3 and 
2.4. OPT^^ is the minimum table entry over all configurations of the root level square. 

Computing the table entries. The table entries are computed in bottom-up order, in the following 
manner. 

Inductively, let 6 be a square at level £ and let 6i, 62, ^3, ^4 be the children of b at level £ + 1. Since 
every tour is i-light, a tour segment inside b crosses the boundaries inside b between b\,b2,bz, &4 at most 
4r times, and always through portals. Thus the segment is the concatenation of at most 4r -|- 1 pieces, 
where a piece goes from some portal TOj to some portal m^+i in one of the children of b. By the relaxed 
definition applied to level l+l, each piece is either rounded or one of the 7T unrounded tours inside a 
child of b. Thus every piece can be described by a tuple (p, q, x), where p, q are portals and x is either one 
of the rounded threshold values ti for some i < t or a number j < indicating it is the j-th unrounded 
tour in a child square of b. The profile $ = (p, mi, 712), (mi, m2, rii), . . . {my,p', ny){f) of the segment is 
the list of those 4r-|-l tuples (representing tour segment pieces), plus a fiag / which is true iff the segment 
is rounded. Consider a profile $ of 6 with flag / = 1. Suppose that the concatenation of the pieces 
described by $ contains x active points. Let ti be the threshold value defined by: ti < x < ti{l+€/ logn). 
Then the DP counts this segment as having ti active points. 

Let D denote the number of possible profiles for a segment in square b. For each profile $, let ritj, 
denote the number of tour segments in b with profile An interface vector I = (n<i>)<i> is a list of D 
entries. Intuitively the vector I, provides the interface between how tour segments in b are formed by 
concatenating the segments of 5's children. 

Let Co be a configuration for square b. The calculation of Lb (Co) is done in a brute force manner by 
iterating through all possible values of the interface vector / and all possible combinations of configu- 
rations in 6's children, Ci, C2, C3, C4. A combination Co, I, Ci, C2, C3, C4 is consistent if / describes at 
most 7 • T unrounded segment and if gluing Ci, C2, C3, C4 according to / yields configuration Cq. 

The cost of a consistent combination is computed in the following way: The cost of configurations 
Ci, . . .C4 is stored in lookup tables Lb^[Ci), 1 < i < 4. Let denote the total number of tour segments 
in b as specified by / (also specified by Co). The value of objective function F, as defined by equation 
2 of (Ci, C2, C3, C4, /) is the sum of the costs of Cj for child square bi, plus (e/ log^ n)2c{,(/). The value 
of Lb{Co) is given by the tuple (Ci, C2, C3, C4, /) consistent with Cq with minimum cost. 

Analysis of the dynamic program. How many possible configurations are there for a square 6? 
There are O(log^n) different pairs of portals (p, g); for each {p,q), there are r entries r^'^ in the first 
sublist and r7 entries u^'^ in the second sublist. Thus a configuration of 6 is a list of 0(r7log^n) 
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entries. Each entry (the number of segments r^'' or number of points u^'*) is an integer between and 
n, thus the total number of configurations for square b is ri'^^'^''''"^^ = n'~'(^°s^^ . As there are O(n^) 
dissection squares, the DP table has size n^^^°^ overall. 

How many possible profiles are there for a segment in square 6? $ has a list of 0{r) tuples {p,p', x). 
There are 0(log^ n) choices of portals p,p' and 0{tj) choices of x, so there are 0{t"/ log^ n) possibilities 
for each tuple. The flag doubles the number of profiles so there are D = (log^^ n)*-'^''^ possible profiles. 
As r = 0(l/e), D = log'^^^/^^ n. 

How many possible interfaces arc there for a square 6? At most , since each n$ is in [0, n]. This 
means we have only a quasi-polynomial number of possibilities for the interface vector / for square b. 

Checking for consistency takes time polynomial in the size of the list of entries in / and C,. 

There arc " possible values for each Ci and " possible values for I. Thus in total it 

takes time polynomial in n'°s^^*^'''' " to run through all combinations of /, Ci, C2, C3, C4 and to compute 
the lookup table entry at Lfc[Co]. 

Remark. The DP verifies the existence of a type-assignment satisfying definition 2.3 but does not 

actually label points with a specific type. It merely records the number of active points that tour 
segments it constructs should have. Once the cost of OPT^^ solution is found, we can trace through 
DP solution's history, and find a valid type assignment by looking at the decisions made by the DP. In 
fact the type assignment can be done during the same time that the tours of OPT^^ are constructed. 
For example while constructing OPT^^, if we build a tour segment with x active points at level £ but 
the DP's history recorded the segment as having t active points, we can choose any x — t active points 
from the segment and label them with type £. ^ 

4 Proof of Structure Theorem 

Let OPT^ denote the i-light solution of minimum length. In Lemma 4.2 we show that F(OPT^^) < 
(1 + 0(e))OPT^, where F is defined in equation 2. As OPT^^ is at most F(OPT^^), we get that 
QprpDP < (1 + 0(e))OPT^. Then we apply Corollary 4.1, given below, to show that OPT^^ < 
(1 + O(e)OPT) and prove our structure Theorem. Corollary 4.1 is a simple generalization of Arora's 

structure Theorem that shows that OPT^ is near optimal. 

Corollary 4.1. (Generalization of Arora) In expectation over the random shifts of the dissection, 
E[OPT^] < {l + O{e))0PT 

Proof. Let OPT^ consist of a set of tour tt = tti, . . . Wm- Apply Arora's structure Theorem 2.1 to each 
tour, sum, and use linearity of expectation. □ 

Lemma 4.2. In expectation over the random shifted dissection, F{OPT^^) < (1 + O{e))0PT^ . 

Proof. To start comparing OPT^^ and OPT^, we apply Lemma 4.3 to turn OPT^ into a solution that 
satisfies the relaxed definition 2.3. See appendix B for proof of Lemma 4.3 

Lemma 4.3. Let S be a CVRP solution on input I . There exists an assignment of types to points, thai 
turns S into a solution that satisfies definition 2.3. The tours of S are not modified and the length of S 
remains unchanged. 

Now OPT^^ and OPT^ are both i-light and relaxed solutions. As OPT^^ also minimizes objective 
function F, we have 

F(OPT^^) < F(OPT^) = OPT^ + (e/log^ n) ^ c{-K^,(.)de 

level e 



^Labelling any active points on the segment with type I will satisfy the relaxed CVRP definition. But we use the randomized 
type assignment procedure 3 to ensure that a solution on these points will have small cost. 
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where tt^ are the tours of OPT^ . Now we only need to show that the last term summing the number of 
crossings in OPT^ is 0(e)OPT^ in expectation. Lemma 4.5 allows us to bound the number of crossings 
in OPT^ in terms of the number of crossings in OPT, and Lemma 4.4 allows us to charge each crossing 
to the length of OPT. See appendix C for the proofs. 

Lemma 4.4. In expectation over the random dissection, for any level i OPT> 0{de)E[{c{iT,£)]. 
Lemma 4.5. For a random dissection at any level i, E[c{'jt^,£)] < 0{logn)E[c{TT,£)]. 
Applying Lemma 4.5 we get, 

(e/log'n) Yl E[c{n'',£)]-di<{e/log^n) 0{logn ■ di)E[c{7r,£)] (3) 

level i level £ 

By Lemma 4.4, E[c{n, e)]0{de) is at most OPT and as irnax = O(logn), Equation 3 is 0(e)OPT. 
This proves the Lemma as OPT < OPT^. □ 

5 Proof of Theorem 2.5 

Let R denote set of the points marked red by Algorithm 1. By Theorem 2.3, the 3-approximation on R 

has cost at most Rad(i?) + TSP(i? U {o}), where o is the depot. Lemmas 5.1 and 5.2 proves Theorem 
2.5 by showing that in expectation both quantities are 0(e)OPT. 

Lemma 5.1. In expectation over the random type assignment, Rad(R) = 0{e)OPT 

Lemma 5.2. In expectation over the random dissection and type assignment TSP{RU {o}) = 0{e)OPT 

5.1 Properties of the randomized type assignment procedure 

We state some properties that will be useful in proving the Lemmas 5.1,5.2. See appendix D for the 
proofs. 

Let 6 be a level £ square containing points labelled type i by Algorithm 1 and S be a a rounded 
tour segment inside b. Let Rs = ri,r2, ■ ■ - rj, be the interval of points labelled type £ on S {\Rs\ may be 
zero) and Sa = si,S2, ■ ■ ■ s^ be the active points on S prior to rounding segment S. We now list some 
Properties of the rounded segment S. 

Property 5.3. < • 0(e/logn) 

Property 5.4. A point s G Sa is in Rg with probability \Rs\/\Sa\- 
Property 5.5. E[length{Rs)] < length{Sa) ■ 0(e/logn) 

5.2 Proof of Lemma 5.1 

Proof. Recall that, Rad(i?) = 2/A:^^g^d(a;,o), where d{x,o) is the distance of point x from the depot. 
By Theorem 2.3 Rad(/) < OPT, so it is sufficient to show that Rad(i?) < 0(e)Rad(/). Fix any 
level £ of the dissection and let Rf be the set of points which were assigned type £. We show that in 
expectation Rad{Rf ) < 0(e/ log n)Rad(/). The claim follows by linearity of expectation (over all levels) 
since Rad(i?) = levei ; Rad(i?£). 

Partition Rg according to the tour segment it is from: Rj C Si,R^ C S2, ■ ■ ■ -R™ ^ '^m where R-'^ is 
the set of red points from tour segment Sj. By definition we have that 

2 ™ 

Rad(/)>-^^(i(o,x) (4) 
j=i xeSj 

As Re is picked randomly, and by Properties 5.4 and 5.3, Pr[a; e R^] < 0(e/logn), so we get 

2 m ^ m 

E[Rsid{Re)] = -^YYl ^) ^^[^ ^ ^e]< ^YYl ^) ' *^(^/ ") 

j=l xGSj j xGSj 
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Combining this with equation 4 we get that £'[Rad(-R^)] < 0(e/logn)Rad(/). □ 
5.3 Proof of Lemma 5.2 

Proof. Let Re be the points labeled type £ at level £. We show that E[TSP{ReU{o})] < 0(e/logn)OPT. 
This implies Lemma 5.2 since the tours of {Rg U {o}} from all levels can be pasted together at the depot 
to yield a tour of (R U {o}). 

Let Be be the squares at level i containing points of Re- We consider the cost of TSP(i?^ U {a}) in 
two parts: the outside and inside costs, where intuitively the outside cost will be the cost to get to the 
squares Be from the depot and the inside cost will be the cost of visiting the red points inside the square, 
focusing on the outside cost, let C be a set of points containing at least one portal from each square of 
Be such that MST{C U {o}) is minimized^. The optimal tour of Rf U {O} is at most 2MST{C U {o}), 
thus 

TSP{Re U {o}) < 2MST{C U {o}) + ^ inside cost of b (5) 

beBe 

Claims 5.6 and 5.7 prove that the quantities on the right hand side of equation 5 are both 0{e/ log n)OPT, 
proving that TSP{Re U {a}) < 0{e/ logn). □ 

Claim 5.6. In expectation over the random shifted dissection, E[2MST{C U {o})] < 0{e/\ogn)OPT. 

Proof We will show that, MST{C\J {a}) < 0(e/logn)OPT^^. The claim follows, as OPT^^ < 
(1 + 0(e)OPT, by the structure Theorem 2.2. 

Consider the fully connected graph G with one vertex for each point in C and one more for the 
depot. Define the weight of an edge of G to be the distance between the two vertices connected by that 
edge. Consider the following linear program, A with value v on G. 



V = mm{we ■ Xe) s.t. 



EeeSiS)^e >7/4r VSCV 

Xe > 



DP 



As each h £ Be contains some points labelled I (i.c at least a group of rounded segments), OPT 
contains are least 7 tour segments crossing into b. Since each tour in OPT''^'^ is i-light, there are at least 
7/4r tours entering b. Thus OPT^^ has at least 7/4r edges crossing any cut separating the depot from 
a point in C. As v is the minimum cost way to have at least 7/4r edges cross all such cuts, OPT'^'^ > v. 

Now consider the linear program A' below. A' is the relaxation of the IP for MST. Let v' be the 
value of A' on graph G. 



v' = miniwe ■ Xe) s.t. < 
I 



Eee5(S) > 1 yS CV 

Xe > 



Observe that for any solution x of A, x' = x ■ Ar is a solution for A'. As A and A' have the same 
objective, v ■ 4r/7 = v'. The MST relaxation A' is known to have integrality gap at most 2 [17], so that 
v' >2 MST (C U {o}). Thus we have that 

QprpDP > „ = ^' . (^/4^) > MST{G U {o}) • (7/2r) 
Thus (2r/7) • OPT^^ > MST{G U {o}) and as 2r/7 = o(e/ logn), the Claim is proved. □ 

Claim 5.7. In expectation over the random dissection and the random type assignment the total inside 
cost at level I is at most 0{e/ logn) OPT. 



C is used only for the analysis and does not need to be found explicitly 
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Proof. The inside cost at level I is the sum of the inside costs of each square h ^ Bg. The contribution of 
square b & B^io the inside cost, is the sum the length of the intervals of type ^ points inside b plus the 
cost of connecting these intervals to the boundary of b. In Claim 5.8 we show that, in expectation over 
the random type assignment, the sum over all squares in Be of the length of intervals of type I points 
is 0(e/logn)OPT^^. The type £ intervals inside each b ^ Bf must be connected to each other and to 
the boundary of their square. We refer to CC{i) as the total the connection cost at level i. CC{£) is 
the sum of the length of the boundaries of each square b E Bf plus the cost of connecting the type i 
intervals inside each b e B^ to the boundary of b. Claim 5.9 shows that CC{£) = 0{e/ logn)F(OPT^'^). 
By Lemma 4.2 and Corollary 4.1 F(OPT^^) < (1 + 0(e))OPT in expectation, proving this Claim. □ 

Claim 5.8. In expectation over the random type assignment the length of all intervals of type £ is 
0{e/\ogn)OPT"^ . 

Proof. Sum the lengths of intervals of type £ over all squares in B^ squares, use Property 5.5 and linearity. 
See appendix D for a detailed proof. □ 

Claim 5.9. The total connection cost for level £ is O{e/logn)F{0PT^^). 

Proof. Let CC{£) denote the total connection cost which is the cost of connecting the type £ intervals 

inside each b € Bg to the boimdary of the square. Focus on one square b G Bf . As OPT^^ is relaxed, b 
has gi) ■ 7 rounded segments for some integer gi, > 0. Consider a group of 7 rounded segments. Let R' 
be a set containing one type £ point from each of the 7 segments in the group. The cost to connect all 

red intervals in the group with the boundary of the square is at most MST{R') + 5 ■ df , where dg is the 
side length of square b. We bound this using the following bound for TSP [11] [4]. (See [11] for a proof). 

Theorem 5.10. Let U be a set of points in 2d-Euclidean space. Let dmax be the max distance between 
any two points of U. Then TSP{U) = 0{dmax^/W\) 

In our context, dmax = de, and U = R'. Since |J?'| = 7, \U\ = 7. By Theorem 5.10 we have that 
MST{R') + 5 • = 0{de, ■ -^7). This holds for each of the g^ groups of rounded segments inside h thus 
we have that the total connection cost for b is 0(5b • dg ■ ^J^). 

Let gi = X^fceB^ 9b^ be the total number of groups of rounded segments at level £. The total connection 
cost for level £ is the sum of the connection costs over all squares b G B^, 

CC{£) = 0{ge -df^) (6) 

Each of the g^ ■ 7 rounded tour segments intersects twice with the boundary of a level £ dissection 
square, thus: c{ir^^,£) > ^gej, with ir^^ being the tours of OPT^^. Using equation 6, 

0(d,/V7)-c(7r^^,^)>CC(^) (7) 

For objective function F, (defined in 2), we have (log^n/e) • i^(OPT^^) > c{n^^,£)de. Substituting it 
for c{'K°P,£)dt in equation 7 we get, 0(l/^)(log^ n/e) • F(OPT^^) > CC{£). As 7 = log'^n/e^ this 
reduces to 0(e/logn) • F(OPT^^) > CC{£), which proves this Claim. □ 

6 Derandomization 

Arora's dissection can be derandomized by trying all choices for the shifts a and b. More efficient 
derandomizations are given in Czumaj and Lingas and in Rao and Smith [7, 14]. As for the randomized 
type assignment Algorithm 3, to guarantee that the cost of the dropped points is small, when selecting an 
interval Y to drop from a segment 5, we only need to pick Y such that (1) Rad(y) < 0(e/ logri)Rad(5) 
and (2)length(F) < 0(e/ logn)length(S'). In Lemma 5.1 and Property 5.5 we prove that these two 
conditions hold at the same time, in expectation when Y is chosen by first selecting a point uniformly 
from S and then selecting the next |y| — 1 consecutive points. To derandomize we can test the at most 
\S\ intervals of length |F| in S, (each starting from a different point in S), and select any interval that 
satisfies these two conditions. 
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A Technical Tools 



The Perturbation Define a bounding square as tlie smallest square whose side length L is a power 
of 2 that contains all input points and the depot. Let d denote the maximum distance between any two 
input points. Place a grid of granularity de/n inside the bounding square. Move every input point to 
the center of the grid square it lies in. Several points may map to the same grid square c;enter and we 
will treat these as multiple points which are located at the same location. Finally scale distances by 
An/{ed) so that all coordinates become integral and the minimum non-zero distance is least 4. 

A solution for the perturbed instance can be extended to a solution for the original instance by taking 
detours from the grid centers to the locations of the points. The cost of such detours will be at most 
n ■ \/2de/n. As the two farthest points must be visited from the depot we have that 2d < OPT. Thus 
the total cost of the detours is < eOPT and is negligible compared to OPT. Note also that scaling does 
not change the structure of the optimal solution. After scaling the maximum distance between points 
is 4n/e which is 0(n) for constant e. 

Randomized Dissection A dissection of the boimding square is obtained by recursively partition- 
ing a square into 4 smaller squares of equal size using one horizontal and one vertical dissection line. 
The recursion stops when the smallest squares have size 1x1. The bounding square has level 0, the 4 
squares created by the first dissection have level 1, and since L = 0{n) the level of the 1x1 squares will 
be imax = O(logn). The horizontal and vertical dissection lines are also assigned levels. The boundary 
of the bounding square has level 0, the 2'~^ horizontal and 2'"^ vertical lines that form level i squares 
by partitioning the level i — l squares are each assigned level i. A randomized dissection of the bounding 
square is obtained by randomly choosing integers a,b € [0, L), and shifting the x coordinates of all hor- 
izontal dissection lines by a and all vertical dissection lines by b and reducing modulo L. For example 
the level horizontal line is moved from L/2 to a + L/2 mod L and the level vertical line is moved to 
b + L/2 mod L. The dissection is "wrapped around" and wrapped around squares are treated as one 
region. The probability that a line I becomes a level £ dissection line in the randomized dissection is 

Pr(level(Z) = I) = 2^/L (8) 



B Relaxed CVRP 

Proof. (Proof of Lemma 4.3) We give an type assignment procedure which initially assigns all points 

to type —1 and never sets any point to a type below —1. As S* is a valid CVRP solution all tours in 
S contain at most k points of type —1 satisfying the first condition of definition 2.3. The procedure 
works in a bottom up fashion in the dissection tree from level ^max to level 0. At the current level (. 
consider each dissection sqiiarc one at a time. For any threshold value ti for i < t while square b has 
at least 7 unrounded tour segments, select exactly 7 such segments and perform a group-rounding as 
follows: Examine each of the 7 segments one at a time. If the unrounded tour segment has x active 
points with ti < x < ti+i, pick any x — ti of these active points and label them as type Perform 
as many group-rounding steps as necessary until square b has at most 7 * r unrounded tours. Proceed 
similarly to the other squares at level 

The type assignment procedure docs not change the construction of any of the tour in S thus the 
cost of S is unchanged. Now we show that definition 2.3 is satisfied. While working at a level I, the 
procedure performs group-rounding on each square at level t until the square has at most 7*r unrounded 
segments. As the groiip-rounding rounds exactly 7 segments togethcir there will always be an integer 
multiple of 7 rounded segments in each square. Condition (ii) continues to hold while the procedure 
works on levels j < ^ as in those levels the procedure only labels points with type j < i so the number 
of active points at level £ remains the same. As for the third condition of definition 2.3, note that before 
rounding a segment at level ^, all points on the segment have type —1 or a type greater than ^. Thus 
prior to rounding the segment will have x active points at level £ and at level £+1. Let ti < x < ti+\. 
To round the segment at £, label x ti points with type £. This still leaves ti active points at level £ 
and X active points at level I + 1. As ti{l + e/ logn) = tj+i > x, the third condition of definition 2.3 is 
satisfied. □ 
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C Extensions of Arora's Results to CVRP 



Let t{'KjJ,) denote the number of times a tour tt^ crosses dissection line I. Arora proved that the 
length(7rj) > \ Xliine i [!]• For CVRP as T^j = OPT, this imphes 

OPT> 1/2 ^ t(7r,Z) (9) 

line / 

With probabihty a dissection hne I forms the boundary for some level £ square ^. For any 

level £ as E{c{'jt, £)) = ^ y^^^ ; t{'K, I) ■ Pi[l forms a boundary a level £ square ], we have 

2^+2 

E{c{n,£)) = — Y^tiTTj) (10) 

line I 

Proof. (Proof of Lemma 4.4) Combine equations 9 and 10 to get E{c{ir,£)) < ^^OPT. The Lemma 
follows from the fact that a level i square has side length de = □ 

Let TT denote the tours of OPT and tt^ the tours of OPT^. Arora gives a procedure to modify tt into 

the light tours tt^ [1]. But the procedure may create new crossings in tt^ with dissection squares not 
present in tt. The next claim bounds the number of these new crossings. 

Proof. (Proof of Lemma 4.5) To modify tt into tt^ Arora's procedure does bottom up patching to ensure 
that each tour crosses the edges of dissection squares at most 0{r) times. The second step is to take 
detours (along the sides of squares) to make the tours portal respecting. Both steps, patching and 
detouring, can add new crossings to tt^ which are not present in tt. A patching on edge e of square b 
adds at most 6 new crossings to each child square inside b with edge e as a boundary and each detour 
adds at at most 2 new crossings to each such child square. 

As Arora's procedure works bottom up the patching and detours taken at all levels j > £ can add 
crossings in ir^ with level £ squares. As there are 2^~^ level £ squares along the edge of a level j square, 
a patching step on the edge of a level j square adds at most 6 • 2'^^-' new crossings at level £ and each 
detour along these edges add at most 2 • 2^~^~^ new crossings at level £. Thus we have that, 

E[c{Tr^,£)] < 6 ^ 2^-^(# patching at level j) + 2 ^ 2^-J(# detours at level j) 

j<i j<e 

A patching step is performed only when a group of at least 0{r) crossings are identified in tt, thus an 
upper bound on the number of expected patching at level j is E[c{Tr,j)]/0{r). Since each crossing at 
level j can require a detour, we get, 

E[c{n^£)] < 6^2^-^M;;^ +2^2^-£;[c(.r,i)] 
j<e ^ > j<e 

Writing out the definitions of E[c{TT,j)] using equation 10 we see that for all j < £, E[c{TT,j)] = 
E[c{7r,£)]/2^~^ . Substituting this into the above equation simplifies it to, 

^;[c(.^^)]<6^^g^ + ^£;[c(.,^)] 

j<e ^ > j<e 

As£< O(logn), we have that E[c{n^,£)] < 0(logn)£'[c(7r,^)](l + 0{l/r)). This proves the Lemma 
as 0{l/r) = 0(e) is a constant. □ 



^The boundaries of level £ squares are formed by lines at levels < £ 
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D Properties of the randomized type assignment procedure 



Let b be a level i square containing points labelled type £ by Algorithm 1 and 5* be a a rounded tour 
segment inside b. Let Rg = ri,r2, . . .r^ be the interval of points labelled type ^ on 5 {\Rs\ may be 
zero) and Sa = Si, S2, ■ ■ ■ be the active points on S prior to rounding segment S. We now list some 
Properties of the rounded segment S. 

Proof. (Proof of Property 5.3) In the DP's history, S has a profile i> with a flag set to true as 5 is a 
rounded segment. Suppose that after S is concatenated according to $ it has x active points. Then 
the DP counts S* as a rounded segment having exactly ti active points for the unique threshold value U 
lying in the interval [x/(l + e/logn),a;]. To get exactly U active points on S we would need to set at 
most X — ti< x{e/\ogn) active points to L Thus \Rs\ < xe/logn while l^a] = x. □ 

Proof. (Proof of Property 5.4) Each point s E Sa belongs to \Rs\ intervals as each interval consists of 
consecutive points. There are a total of \Sa\ different intervals, each starting at a different point in 
Sa and Algorithm 3 picks uniformly among them. □ 

Definition D.l. (Length of interval) Let 61,62 be the points on the boundary of b where S enters and 
exits b. Then S visits si after entering at 61 and it visits Sx before exiting from 62 Let d{u, v) denote the 
distance between points u and v. If Rs does not contain both Si and Sx then length{Rs) = d{si, s^+i). 
Otherwise let re = Sx, then r^+i = si as Algorithm 3 wraps around and length{Ra) = YllZx d{si, Si+i) + 
d{sx, 62) + d(6i, si) + J2i=e+i d{n,ri+i). 

Proof. (Proof of Property 5.5) Let 61 and 62 be the points where S enters and exits square 6. Define 
Zx = d{bi,si) + d{sx, 62) and Zi = d{si, Sj+i), for 1 < i < a;. Then the length of Sa is J2i=i ^^'^ 

X 

E[\ength{Ra)] = Zi PT[zi is counted inside Rg] 

i=l 

For all i the probability that Zi is counted is the probability that Si and its consecutive point, Sj+i, 
are both included in Rs. (The consecutive point of s^: is si). A point s £ Sa belongs to exactly 
intervals and the consecutive point of s appears in |iis| — 1 of these intervals. Thus the Pr[2:i is counted ] = 
{\R,\ - l)/{\Sa\). Applying Property 5.3, 

i.[length(i?.)] = E^.^ = 0{^)p. = (^) length(5.) 

□ 

Proof. (Proof of Claim 5.8) Sum the lengths of intervals of type £ over all squares in Be squares, use 
Property 5.5 and linearity. Consider a square b G and the type £ points inside 6, Rb . Partition the 
points in Ri, according to the segments of 6 they come from: ri C si, r2 C S2,. ■ .rm C Sm such that 
rj is the set of points labelled type £ from tour segment Sj. By Property 5.5, in expectation over the 
random type-assignment the length of rj is at most 0(e/logn) times the length of Sj. By linearity, 
X;7-E[length(rj)] < 0(e/ logn) ^Jl^ length(sj). Let OPTf^ denote the projection of OPT^^ inside 
square 6. OPTf ^ > length(sj), and OPT^^ is at least the sum of OPTf ^ over all squares 6 e B^. 
This implies that the total length of red intervals at level £ is at most 0(e/logn)OPT^^. □ 
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